Quantification of glottal and voiced speech harmonics- to-noise ratios using cepstral-based estimation
نویسندگان
چکیده
Cepstral analysis is used to estimate the harmonics-to-noise ratio (HNR) in speech signals. The inverse Fourier transformed liftered cepstrum approximates a noise baseline from which the harmonics-to-noise ratio is estimated. The present study highlights the manner in which the cepstrum-based noise baseline estimate is obtained, essentially behaving like a moving average filter applied to the power spectrum for voiced speech. As such, the noise baseline, which is taken to approximate the noise excited vocal tract, is also shown to be influenced by the window length and the shape of the glottal source spectrum. Two existing estimation techniques are tested systematically for the first time using synthetically generated glottal flow and voiced speech signals, with a priori knowledge of the HNR. The source influence is removed using preemphasis to obtain an improved noise baseline fit. The results indicate accurate HNR estimation using the new approach.
منابع مشابه
A cepstrum-based harmonics-to-noise ratio in voice signals
A new cepstrum-based technique is developed in order to provide an alternative means of estimating the harmonics-tonoise ratio in voice signals. The geometric mean harmonics-tonoise ratio (GHNR) is defined as the mean of the individual spectral (i.e. at specific frequency locations) harmonics-tonoise ratios in dB. A heuristic development of the method treats the harmonic spectrum (in dB) of voi...
متن کاملA Neurally Motivated Technique for Voicing Detection and F 0 Estimation for Speech
Speech consists of alternating voiced and unvoiced sections. Voiced speech consists of multiple harmonics of some fundamental (F0); unvoiced speech consists of silence, or ltered noise. Here, speech is wideband bandpass ltered into many bands (modelling the cochlea). Each lter output is rectiied (modelling the organ of Corti hair cell action), and bandpass ltered by convolution with the diieren...
متن کاملEvaluation of cepstrally derived harmonics-to-noise ratios in voice signals
The harmonics-to-noise ratio (HNR) has been shown to provide a useful, non-invasive index for assessing and analyzing voice disorders. As such, a valid and reliable method for estimating levels of noise and hence HNR, in human voice is important for effective evaluation and management of voice pathology. An objective method to determine the accuracy of two cepstrum-based techniques described in...
متن کاملRobust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech
This paper presents a robust feature extraction method effective to speech signal with high fundamental frequency and/or corrupted by additive white noise. The method represents the glottal source wave using HMM in order to model the nonstationary properties. The nodes of HMM are concatenated in a ring state to represent the periodicity of voiced sounds. The method can accurately extract glotta...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کامل